Using Verb-Noun Patterns to Detect Process Inputs

نویسندگان

  • Munshi Asadullah
  • Damien Nouvel
  • Patrick Paroubek
چکیده

We present the preliminary results of an ongoing work aimed at using morpho-syntactic patterns to extract information from process descriptions in a semi-supervised manner. The experiments have been designed for generic information extraction tasks and evaluated on detecting ingredients from cooking recipes in French using a large gold standard corpus. The proposed method uses bi-lexical dependency oriented syntactic analysis of the text and extracts relevant morpho-syntactic patterns. Those patterns are then used as features for different machine learning methods to acquire the final ingredient list. Furthermore, this approach may easily be adapted to similar tasks since it relies on mining generic morpho-syntactic patterns from the documents automatically. The method itself is language independent, considering language specific parsers being used. The performance of our method on the DEFT 2013 data set is nevertheless satisfactory since it significantly outperforms the best system from the original challenge (0.75 vs 0.66 MAP). 1 Process Inputs in Specification Document We are looking for a generic method to extract clearly defined target information from specification documents written in natural language. Specification in this context is the generic class of documents written to explain how a set of input elements are to be used or intended to be used in the processes described in the document. For the purpose of clarity we shall exemplify all the formal discussion within the cooking recipe domain.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Collocational Clashes in the Persian Translations of Tuesdays with Morrie

This study aimed at finding features of collocational deviations in the translations of Tuesdays with Mor- rie. In this direction, categories of collocations and collocational clashes, as well as causes of collocation- al clashes were explored. The present work investigated five Persian translations of the novel. All the books were examined completely and all possible collocational clashes were...

متن کامل

Dependency Parsing for Identifying Hungarian Light Verb Constructions

Light verb constructions (LVCs) are verb and noun combinations in which the verb has lost its meaning to some degree and the noun is used in one of its original senses. They often share their syntactic pattern with other constructions (e.g. verbobject pairs) thus LVC detection can be viewed as classifying certain syntactic patterns as light verb constructions or not. In this paper, we explore a...

متن کامل

A New Approach to Syntax

This paper describes a new method for syntactic analysis of English. Instead of the conventional subject-predicate structure as a basis for analysis, elementary sentence patterns are used. It is observed that there are two basic sentence formats in English. One, using a transitive verb, consists of the sequence noun, verb, noun, noun. The other, using an intransitive verb, consists of the seque...

متن کامل

Detecting Noun Compounds and Light Verb Constructions: a Contrastive Study

In this paper, we describe our methods to detect noun compounds and light verb constructions in running texts. For noun compounds, dictionary-based methods and POStagging seem to contribute most to the performance of the system whereas for light verb constructions, the combination of POStagging, syntactic information and restrictions on the nominal and verbal component yield the best result. Ho...

متن کامل

بررسی تأثیر پیچیدگی نحوی ساختار گروه اسمی ‌و فعلی بر وقوع لکنت در کودکان لکنتی پیش‌دبستانی ۶-۴ سال فارسی‌زبان

Objective: The purpose of the present research was to investigate the effect of syntactic complexity of noun phrase and verb phrase on the occurrence of stuttering in 4-6 year Persian speaking children with stuttering. Materials & Methods: This descriptive-analytic research was done on 15 stuttering children, consisting of 12 boys and 3 girls, 4 to 6 years old monolingual Persian speaking wh...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014